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One-Component Regular Variation 
and Graphical Modeling of Extremes 


Adrien Hitz and Robin Evans 
University of Oxford 

The problem of inferring the distribution of a random vector given that its norm 
is large requires modeling a homogeneous limiting density. We suggest an approach 
based on graphical models which is suitable for high-dimensional vectors. 

We introduce the notion of one-component regular variation to describe a func¬ 
tion that is regularly varying in its first component. We extend the representation 
and Karamata’s theorem to one-component regularly varying functions, probability 
distributions and densities, and explain why these results are fundamental in mul¬ 
tivariate extreme-value theory. We then generalize Hammersley-Clifford theorem to 
relate asymptotic conditional independence to a factorization of the limiting density, 
and use it to model multivariate tails. 


1 Introduction Regular variati on describes the behavior of some functions when eval¬ 
uated close to infinity 
surable function u on 


Seneta 


1973t |. In the multivariate case, regular variation of a mea- 


as 


' \ {0} with limit v such that u(Al), u(Al) > 0, VA > 0, is defined 
u{tx) 


u{tl) 


v(x), Mx, 


( 1 ) 


Resnick 


19871]. A striking result is that, in this case, the limit is homogeneous, i.e., there 


exists a € M such that v{\x) = A“u(®), VA > 0, V®. 

We suggest an alternative definition: a function u is called regularly varying w.r.t. its 
first component if 

'titfnr 'ii\ 

v{x,y), 


u{tx,y) 


u{t,l) 

where u and v are measurable non-negative functions on (0, oo) x such that u(-, 1 ) > 0 
and u(-,l) > 0. One-component regular variation includes regular variation as a special 
case. For instance, u is regularly varying on M^|_\{0} if and only if uoip~^ is regularly varying 
in its hrst component, where yp{x) = (||a;||,a;/||a;||) and ip~^(r,6) = rO. In Section [H we 
generalize two important results in Karamata’s theory — i.e. the study of regular variation 
— from the univaria te to the multivariate case. The first is the representation theorem 


Bingham et al.l 1989(1, which states that any one-component regularly varying function can 













be written in a specific form. The second, Karamata’s theorem, reveals relations between 
a one-component regularly varying function and its integral. 

Regular variation is, above all, the limit of a fraction. The specific form of the limit 
v{x) = x" in the univariate case comes from the fact that the operation a • x x“ is 
distributive over the division; when regular variation is applied in probability, the fraction 
corresponds to conditioning. The multiplication tx, h owever, is an arbitrary choice o f 
operation that can be replaced by a more general scaling iBingham and Ostaszewskil 2010(]. 
Choosing f*x e->■ T“^{r(t)T(x)} for a diffeomorphism T gives a limit of the form T(x)", 
thus extending regular variation to other decays than the power law. This is not just a 
trivial transformation, but enriches the representation and Karamata’s theorems. 

Extreme-va Ke theory stu dies the distribution of a random vector X in regions far from 
the origin. In 
distribution on Cm 


Basrah 


2000t |. X is called regularly varying if there exists a probability 
= {x € : ||x|| > 1} s.t. i/(AC||.||) > 0 for some A > 1, and 

X\\>t^u, (2) 


t-^x 


where ^ denotes weak convergence. In this case, v is homogeneous, i.e., there exists a > 0 
such that ^{XA) = X~°‘l'{A), for all Borel sets A and A > 1. Basrak’s exact definition 
involves vague convergence to a non-null Radon measure u on ({—oo} U M U {oo})'’* \ {0} 
called the exponent measure. For simplicity and without losing generality, we stick to weak 
convergence between probability distributions throughout the paper. 

We say that a random vector {X, Y) is regularly varying w.r.t. its first component, with 
a limiting probability distribution /r on [l,oo) x if 

{t-^X, Y) \ X >t^ yL, 

and nx the Dirac m ass on 1. Limits of random ve c tors with an extreme com ponent 

have already been studied 


Heffernan and Resnic 




20071] . 


Resnick and Zeber 


2014 ] and it 


is known that pL = P^x H, where Pa is the Pareto distribution of index a > 0 and 77 is a 
probability distribution on In particular, ([2|) is equivalent to one-com ponent regular 

variat ion of ip{X), in which case H corresponds to the angular distribution iBeirlant et al. 


200411 . 


In Section [2l we adapt the representation and Karamata’s theorems for random vari¬ 
ables and, as a result, characterize homogeneous probability measures. In addition, we 
show that if X admits a regularly varying probability density with non-null limit and if 
the sequence is for instance dominated by an integrable function, then 

/t-ix I ||x||>t /V) (3) 
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for a probability density Jy on C'n.n, and thus X is regularly varying. In this case, fy 
is homogeneous of order —a — d. This extends Theorem 2.1 in iHaan and Resnicld 19871], 


which implies regular variation of X when, in addition to regular variation of fx, the 
convergence is uniform, and the limit is bounded on <1711.11. 

Besides describing exceedances over a threshold, regular variation of X is directly re - 
lated to the behavior of maxima of independent copies X^^\ As explained in Resnick 1987 ]. 


if X is non-negative, then it is regularly varying if and only if there exists a sequence 
On —>■ oo such that maxj=i^,,,^„ ^ G for a non-degenerate probability distribution 

G on if the marginals of G are not degenerated, then they are Frechet distributed 

and G is called Type II multivariate extreme-value distribution. The cumulative distri¬ 
bution function of G is exp{—i^dO, where D is the exponent measure. The problem 
of modeling i> or the corresponding copula has received much attention in multivariate 
analysis. Unless d is particularly small, the density of G is approximated by the one of D 
using a Taylor expansion, i.e., up to a constant, by Jy- Hence, modeling both maxima and 
exceedances of X requires modeling a homogeneous density. 

The usual strategy is to write fY{y) = ||yII) and model the angular 
density h = dH satisfying some constraints to ensure that Y has Pareto marginals. 
Suggested rnqdels for h include the a symmetric logistic 


Boldi and Davison 


Engelke et al 


Cooley et al.l 12011 ] , its g eneralization 


Tawn 


Ballani and Schlather 


1990(1, the pairwise beta 


201l|, mixtures of Dirichlet 


2007( 1 and the angular density of the Hiisler-Reiss exponent measure 


2015( 1 . However, they suffer limitations in high-dimensions as the number 


of parameters explodes and may lack flexibility to describe multivariate tails. 

As a consequence of ([3]), any parametric probability density /x(s^) that is regularly 
varying with decay T and satisfies the additional condition induces a homogeneous para¬ 
metric density /y (•; 9) in the limit. If the multivariate marginals and the censored densities 
of fxi‘',9) are available, we directly find the ones of /y(-;0) by passing to the limit. A 
complementary approach pursued from Section [3] is to sir nplify fy using graphical mod - 
els, which have been snccessful in reducing dimensionality I Wainwright and Jordan 


200 . 


In Section 01 we define a new notion called asymptotic conditional independence and we 
generalize Hammersley-Clifford for sequence of densities that factorize in the limit w.r.t. a 
graph. 

Section [5] explains how to use asymptotic graphical models for multivariate exceedances. 
We consider a sequence Xf based on censoring the marginals of X whose absolnte values 
fall below t. Under ([3l) and Jy > 0, the generalized Hammersley-Clifford theorem states 
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that some marginals of are asymptotically conditionally independent given the rest 
of the vector if and only if the censored density of Y is determined by lower dimensional 
marginals Yq, where C are the cliques of a graph. We show that Yc corresponds to the 
limit oi Xc, enabling inference to be performed on each clique separately once the graph 
has been selected. 


1 One-Component Regular Variation for Functions 


Consider the commutative and associative operation 

x-ky := T~^{T{x)T{y)}, x,y e E = [cq, ei), (4) 


where T : E ^ [1, oo) is a diffeomorphism, for cq € M, ei G M U {oo}. When E = [0, oo), 
possible operations include multiplication (T = id), addition (T = exp) and x-ky = ||x,y||p 
for p > 0, T{x) = exp(xP). When E = [0, ei) and ei < oo, T{x) = (1 — gives 

xky = x + y — e^^xy. We call a positive and measurable function u on E regularly varying 
with decay T and limit v if 


u{t k x) 
u{t k eo) 


—> v{x) > 0, 


( 5 ) 


where the arrow stands for pointwise convergence on the entire domain of definition as 
t^ei- It follows that u o T~^ is regul arly vary i ng, an d thus v{x) = T{x)°‘ for some a G 
The convergence moreover is uniform 


Eesnick 


1987 1. We denote ([5]) by u G T-RVq. 


Let us extend this notion to the multivariate setting. Consider two measurable and 
non-negative functions u and v on E x such that u(-,l) > 0, u(-,l) > 0, and a 

non-negative function h on satisfying h{l) = 1. We call 1 the pivot and its choice is 
arbitrary. 


Lemma 1.1 (Characterization). The following are equivalent. 

(i) u{tk x,y)/u{tk eo,l) ^ v{x,y), 

(a) u{-, 1) G T-RVa and u{t, y)/u{t, 1) — h{y), 

(Hi) u{-, y) G T-RVa, Vr/ s.t. h{y) > 0, and u{t, y)/u{t, 1) —)• h{y), 
(iv) u{tk x,y)/u{tk eo,l) ^ T{x)^ h{y) uniformly in x. 
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Proof, ([il) =► ([nl): to derive the first condition, set y = 1; for the second, x = oq. (jn|)^(|m|): 
let y s.t. h{y) > 0. Since * cq, ★ eo, 1) —> h{y), u{t-keQ,y) > 0 for t sufficiently 

large. It follows that 

u(t-kx,y) u{t-kx,y) u{t-kx,l) u{t-k eo,l) 


u{t -k Co, y) u{t k X, 1) u{t k eo, 1) u{t k Cq, y) 
mil ^ (|iv|): as h(l) > 0, 

u{tkx,y) u{t k X,y) u{t k X,l) 


T{x) 


u{tkeQ,l) u{tk X,l) u{tk eo,l) 
and the convergence is uniform in x. ()i^ ^ (jj): clear. 


T(x)“h(y), 


□ 


We say that u is regularly varying in its first component if hold, written u € 

T-RV* (h), or simply when T = id. Con dition u{-,y) G RV^, Vy, corresponds to the 

if it is assumed further that the convergence 


Meerschaert I 


uniform regular variation of 
is uniform in y. 

We now generaliz e the representatio n theorem for univariat e regular variation (see 


Bingham et al.l 1989l | in the case T = id. 


Jaros and Kusano 


2004 1 otherwise). 


Theorem 1.2 (Multivariate Representation Theorem). It holds u G T-RV^{h) if and only 

if 


u{x, y) = c{x) exp 


'eo 




for some measurable functions s.t. c{t) —>■ c > 0, a{t) —>■ a, q{t, y) —>• h{y) as t\ ei. 
As a consequence, Ve > 0, 3ci, C 2 > 0 such that 

cir(x)““"/i(y) < u{x,y) < C 2 T(x)"+"/i(y), 
for X sufficiently large and y satisfying h{y) 0. 


( 6 ) 


Proof. For the direct implication, write u{x, y) = u{x, y)lu{x., 1) tt(x, 1). Lemma [LT] gives 
q{t,y) := u{t,y)/u{t, 1) h{y) and u{x, 1) G T-RVq,. The conclusion follows by applying 
the representation theorem on the regularly varying function u{T~^{x), 1}. For the reverse, 
consider z = tk z and dz = T{t)T'{z)/T'{t k z) to find 

pt-kX pX 

^)T'{z)/T{z)dz = / a{tkz)T'{z)/T{z)dz^a\ogT{x), 

J en 


a{z 


' tkEQ 


and thus u{t k x, y)/u{t k eo, 1) —>■ T{x)°^h{y). 


□ 
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Another important result is Karamata’s t heorem, which relates the regular variation 


of a univariate function to that of its integral iBingham et alJ 1989]. We generalize it for 
one-component regular variation, only treating the case of a negative power index —a for 
a > 0. Suppose that U{x,y) := u{z,y)dz exists Vx,y (it exists for eo sufficiently large 
if u{x,y)/T'{x) G as a consequence of (iBll ). 


Theorem 1.3 (Multivariate Karamata’s Theorem). It holds 


u{x,y) 


GT-Rr^_,{h) 


T(t) u(t,y) 


ah{y). 


T'{x) ^ T'{t)U{t,l) 

In this case, U € T-KV^_^{h), and its representation has coefficient c{x) = U{eo, 1). 


Proof. To prove the direct implication, use the change of variable z = t-k z to find 

T{t-keo) u{t-keo,y) _ f u{t-kz,l) T'jt-keo) , , ) u{t-keo,y) 

Tft-keo) f^^^u{z,l)dz \Jeo u{tk eo,l) T'{t-k z) ^ u(tkeo,l) 

T{z)~°‘~^T'{z)dz'^ h{y) = ah{y), 

because tt(-, 1)/T'(-) G T-RV_a_i and u{t k eQ,y)/u{t k cq,!) —>■ h{y) from Lemma fl.ll 
Limit and integral can be exchanged because cT{z)~°~^~^''T'(z) dominates the integrand 
for some c > 0, e G (0, a) and t sufficiently large thanks to Q. 

For the reverse implication, let a{x) := {u{x, 1)/U{x, 1)}/ {T'{x)/T{x)}, which satisfies 
a{t) —)• a. Integrate u{z,l)/U{z,l) = a{z)T'{z)/T{z) from both sides between eo and 
X to obtain C7(x,l) = (7(eo, 1) exp |—a( 2 :)T'( 2 ;)/T( 2 ;)fiz| . This is the representation 
of a T-regularly varying function with c{x) = U{eo,l) thanks to Theorem 11.21 Hence, 
T'{t k eo)/T'{t k x) u{t k X, y)/u{t k eo, 1) equals 


T'{t k eo) U{t k eo,l) T{t k x) u{t k x,y) T{tkeo) U{tkx,l) 
T{tkeo)u{tkeo,l) T'{t k x) U{tk x,l) T{tkx) U{tkeo,I) 


T{x)-^-^h{y), 


which ends the proof of the equivalence in (|1.3I) . Moreover, U G T-V0Jci{h) because 


U{tkx,y) 
U{tkeo, 1) 


= U{tkeo, 1) 


roo 

' / 

J t-kX 


u{x, y)dx 


u{t k eo, 1) T(t k eo) 
U{t k Cq, I) P'{t k eo) 



u{t k z, y) T'{t k eo) 
u{t k eo, 1) T'(t k z) 


T'(z)dz ^T(x)-‘^h(y). 


□ 


For instance, consider the Gaussian distribution J\f{fi,a‘^) with probability density (f 
and survival function $. Since 2~^t~^(f){t)/^{t) —> Theorem [T3] gives (j){x)/T'{x) G 
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T-RV_ 2 - 1 ct- 2 _i and $ E T-RV_ 2 -i(j -2 for T{x) = exp(x^). Let / be the probability density 
of the log-Cauchy distribution and F its survival function. As tlogt f(t)/F{t) —)• 1, it 
follows that xf{x) E log-RV_ 2 , and F E log-RV_i. 

We now extend the standard multivariate regular variation in ([T]) to general decays. 
We denote the sign of x E M by cia; E { — 1,0,1} and <Tx = ■ ■ ■ jO'xa) when x E M'^. 

Operations between vectors are done componentwise, x*y := (xl) *y, and bq := eol. We 
say that u : —>■ [0, oo) satisfying m(A1) > 0, VA E E, is regularly varying with decay T if 
there exists v such that u(Al) > 0, VA E E, and 


u{{t-k |a;|)<Ta;} 
u{t -k Bq) 




(7) 


on F := {(—Cl, — cq) U {0} U [cq, ei)}'^ \ {0}; by convention, {t * |x|)la: = 0 if x = 0. 

Multivariate regular variation is easily expressed in terms of one-component regular 
variation by introducing the following change of variable. We say that (/> : C —>■ (0, oo) x fl, 
for C defines a radial system of eoordinates if it has the form 


cf : X {r{x),e{x)}, 4> ^ : {r,e) {r k\e ( 8 ) 


where r(-) and 0{-) satisfy r{(A * |x|)<T 3 .} = Xkr{x), 0{(A * |x|)<T 3 .} = 6{x), VA E E. 
Examples include spherical coordinates on and pseudo-polar coordinates defined by 
r{x) = ||a;||, a norm, and 0{x) = on When T = exp, the latter translates 

into r(x) = log(||e®||), 9{x) = x — r(x). It becomes clear that u is regularly varying if and 
only if g{r, 6) := 0)} is regularly varying w.r.t. its first component since 


9 {t*r,e) ^ |a;|)cr,^} 

g{tklr,le) u{tkBQ) 


(9) 


and in this case. Lemma 1 1.1 1 gives v{x) = T{r(x)}"/i{0(x)}, where 1,. = r(eo) and le = 
0(eo). In particular, v is homogeneous of order a, i.e., u{(A*|x|)cra,} = T(A)“u(a;), VA E E, 
and we write d?]) as u E T-EVaiv). 


2 One-Component Regular Variation for Probability Distri¬ 
butions 

So far, one-component regular variation has been treated for functions; we develop it further 
for distributions. A specific representation is found for regularly varying random vectors. 
Their limits moreover form the class of distributions that are homogeneous w.r.t. their first 
component. In a subsequent result, we show that one-component regular variation of the 
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probability density implies, under an extra condition, one-component regular variation of 
the distribution. 


For simplicity, we treat the case T = id. Let (X, "K) be a random variable with probabil¬ 
ity distribution fj, on [l,oo) x We call F{x,y) = y,{[x,oo) x (—oo,y]} the x-survival 

function. Subscripts X or Y refer to the corres ponding marginal distribution. Suppose 


that Fx > 0. We are interested in the weak limit iRillingslevI |l9^ of {t ^X,Y) \ X >t 


equivalent to the weak limit of its distribution B)/Fx{t), and of its x-survival function 
F{tx,y)/Fx{t). Let H and u be two probability distributions on and E respectively 
such that nx 7^ We denote by Pa the Pareto distribution on [1, oo) with shape a > 0. 


Theorem 1.4 (One-Component Regular Variation for Distributions). The following are 
equivalent. 

(i) {t-^X,Y) \ X>t^v, 

(ii) Y \ X>t^ H and Y^X \ X>t^Pa, 

(Hi) F{x,y) = c{x) exp J^^a{z)z~^dz'^ Q{y \ x), for measurable a{t) —>■ a, c{t) —>■ c 
and a conditional cumulative distribution function Q{y | t) ^ H, 

(iv) {t-^X, Y)\X>t^Pa X H, 


We say that {X, Y ) is regularly varying w.r. t. its first component, written {X, T) € 


(livl) can be found in 

(jinds 

coe: et al. 

Heffernan and Resnick 

2007 

1 (Propo- 


for in stance F{x) = e~^. The equivalence of (ji|) anc 
2014l | (Theorem 3.1 with {A, (x, y)} (Ax, y)) or in 
sition 2 with a = 1, /3 = 0). Our contribution is the representation in (juil) and a short 
proof. 


Basrak 


2000 1 ■ t~^X \ X > t ^ ux implies nx = P-a 

H, write F{x, y) = 
Fx{x)FY\x>x{y I x) and apply the representation theorem on Fx- diull => (livll : 


Proof. (ED ^ (jul): By Theorem 2.1.4 in 
for some a > 0. (ElD ^ (EIlD- since Fx G RV_q and FY\x>t 


F{tx,y) c{tx) 
Fx{t) c{f) 


exp 




a(z)z ^dz D Q(y | tx) ^ x °^H{y), 


because f^^a(z)z ^dz = a(tz)z ^dz—>-alogx. (E3 


clear. 


□ 


Suppose further that (X, Y) admits a probability density / w.r.t. the Lebesgue measure 
such that fx and /(•, 1) are positive. We want to know what the relation is between regular 
variation of / and of (V, Y ). 
























We start by answering the question in the univariate case. From Karamata’s theorem 
fTheorem II.311 . fx G RV_q,_i implies Fx G RV_q, and its representation has coefficient 
c(-) = 1. Thus, X G RV_a and its representation now determines exactly a probability dis¬ 
tribution: Fx{x) = exp {— a{z)z~^dz'^ , Vx > 1, for a positive and measurable a{t) —>■ 

a > 0. The following example shows that regular variation of X does not imply regular 
variation of fx in general. Equalities and pointwise convergences between densities hold 
almost everywhere. 

Example 1.5 (Regular Variation of the Distribution but not of the Density). Let Fx{x) = 
exp { —on [l,oo) and a{x) = sin(x) + 2. Since Fx{x) = c{x)x~‘^ for c{x) = 
ex.p{— y~^ sin y dy} satisfying c{t) —>■ 1, the representation theorem gives Fx G RV_ 2 , 
and thus X G RV_ 2 . However, tfx{t)/Fx{t) = a{t) 2, and from Karamata’s theorem 
fx is not regularly varying. 


We now provide an answer in the multivariate case. If / G RV’’’(u) such that the limit v 
is non-null and the sequence is dominated by an integrable function, then by the dominated 
convergence theorem 

ft-X,Y\xMx,y) = ^ c-^v{x,y), ( 10 ) 

where c* = f{tx, y)/f{t, l)dxdy c = v{x, y)dxdy < oo. Alternatively, 

we can assume v integrable and the sequence monotone instead of dominated. Let a > 0 
and LI be a probability distribution on with density h satisfying /i(l) > 0. According 
to Lemma ll.il the limit in (jinj] has the form ax~°^~^h{y), and thus {X,Y) G RV_q,(LL). 
This is an important result: we can guarantee regular variation of (V, Y) simply by com¬ 
puting the limit of f{tx,y)/f{t, 1), a useful approach when F is intractable or the weak 
convergences in Theorem 11.41 are difficult to check. Whereas monotonicity or domination 
is sufficient to obtain (jlOp . our next result reveals necessary and sufficient conditions. 


Theorem 1.6 (One-Component Regular Variation for Densities). The following are equiv¬ 
alent. 


(i) /y|x(l 
(i-i-) fY\x{y 
(m) fY\x{y 
(iv) fY\x{y 


t) ^ h{l) and f G RW,_i{h(.)/h(l)}, 

t) h{y) and fx G RV-a-i, 

t) h{y) and fx\Y=i € RV-a-i, 

t) h{y) and fx\Y=y G RV-a-i, ^y s.t. h{y) > 0, 


9 



(v) f{x,y) = a(x)x ^ exp ^ — a{z)z ^dz'^ q{y \ x), for a positive and measurable 
a{t) —>■ a and a eonditional probability density q{y \ t) —)• h{y), Mx > 1, Vy, 

(vi) ft-^x,Y\x>t{x,y) ax-^-^h{y). 

In this case, {X,Y) € RVf^{H). 

In (jl|), the first condition is necessary and can be replaced by tf{t,l)/F{t) —>■ ah{l), 
and in ([nl) by tf{t,y)/Fx{t) ah{y)/h{l). 


Proof. (Ii|) straightforward; use Lemma 11.11 for (InTll ^ (Ir^ and (p^ => (ED- dUD 

=> (j^; write f{x,y) = fx{x)fY\x{y I x). On the one hand, fx £ RV_q, and thus from 
Karamata’s theorem iTheorem ll.SD Fx has a representation with c(-) = 1. Differentiate this 
representation to find one for fx- On the other hand, fvixiv I t) h{y) by assumption. 
Altogether, this gives (j^, a well-defined probability density because e and q are measurable, 
and Tonelli’s theorem ensures it integrates to 1. (j^ => (j^: integrate (j^ to find F{x) = 
exp a{z)z~^dz^ , and thus tf{tx,y)/Fx{t) = a{tx)x~^ exp — a{tz)z~^dz^ q{y \ 

tx) ax~°‘~^h{y). ^ (jl]): first, 


f{tx,y) ^ tf{tx,y) Fxjt) _a_i h{y) 
/(LI) Fx{t) tf{t,l) ^ h{l)' 


Durrett 


Second, apply Scheffe’s Lemma 
in particular, X G RV_q,. Karamata’s theorem ensures that 


2010 | on (|^ to obtain {X,Y) G RVLq(LI) and, 


/y|x(l I i) 


tfjt, 1) Fxjt) 
Fx{t) tfx{t) 


h{l). 


□ 


The following example illustrates the use of Theorem 11.61 which holds also when the 
convergence is not uniform in y. 


Example 1.7 (Illustration of Theorem II.6p . Consider 


f{x,y) 


fx 2 if y G (0,® 1), 
else. 


on [1,oo) X (0,1]. 


The convergence f{tx,y)/f{t, 1) —)■ x~‘^y gives / G RV_ 2 {/i(-)//i(l)} for the cdf H{y) = 
with probability density h{y) = 2y on (0,1]. Since f{t, l)/fx{t) —)• 2 = h{l) (condition ([I]), 
Theorem 1 1.6p . {X,Y) G RV_a(Ll), i.e., F{tx,y)/Fx{t) —)• x~^y‘^. The same conclusion is 
drawn by showing fx G RV _2 and /y|x(y I 2y = h{y) (condition (jn]), Theorem 11.61) . 
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The results of this section are easily extended to hold for the arbitrary decay T defined 
in (0). We write {X,Y) G T-EV^^{H) if {T{X),Y} G RV^„(F). From the continuous 
mapping theorem, this is equivalent to /it* u, where /iA*(^, B) := fi{\ -k A,B )/(A * E) 
is a probability distribution on E x for every A G -B. In this case, u = Pa oT x H, and 
in particular I'x has survival function We say that /i is homogeneous w.r.t. its 

first component if ^{X-k A, B) = T{X)~°‘^{A, B), VA G E. Theorem 11.41 characterizes such 
distributions. 


Corollary 1.8 (Homogeneous Distributions). The following are equivalent: /i is homoge¬ 
neous of order —a w.r.t. its first component; F is homogeneous of order —a w.r.t. its first 
component; /iA* = /i, VA G Bi; X _LL and X ~ Pq, o T; (under its existence) fx,Y is 
homogeneous of order —a — 1 w.r.t. its first component. 


We now focus on one-component regular variation when the first component is the 
radius of a random vector X and the rest of the vector is the angle — note the change in 
the notation. Let (i?, 0) = 4>{X) = {r(X),0(X)} be its expression in radial coordinates 
defined in ([ 8 ]) with T = id and suppose that Pr > 0. Let a > 0 and Y ^ v, a probability 
distribution on Cr = {r{x) > 1}, such that r(Y) 9 ^ di. 

Analogously to functions, X is multivariate regularly varying in the sense ([2]) if and 
only if (i?, 0) is one-component regularly varying. This relation is well-known when, for 
instance, 4> is the change of variables into pseudo polar coordinates 


Basrak et al. 


200211 . 


Proposition 1.9 (Multivariate and One-Component Regular Variation). 


t-^X \X ^tCr^n on Cr 


{R,Q) £ RV-a{H) on [l,oo)xH. (11) 


In this case, v is homogeneous of order —a, i.e., v{XA) = A °‘u{A), VA > 1. 

Proof. We only prove the direct implication; the reverse goes similarly. Since is ho¬ 
mogeneous of order 1 w.r.t. its first component, write t~^R, @ \ R > t = (l)(t~^X) \ R >t, 
which converges weakly to a probability distribution by the continuous mapping theorem. 
Hence, {R, 0) G RV_a(H). To show the homogeneity of u, let Y = (j)~^{R*, 0*) ~ u and 
use Corollary 11.81 

u{XA) = Pt{(I)-\X-^R*, 0*) G a I P* > A}Pr(P* > A) 

= A-“Pr{(/)-i(P*, 0*) G A} = A-“i/(A), VA > 1. 


□ 
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We denote the multivariate regular variation in (fTT|) by X € RV_q,(z/) or RV_q,(W). 
Regular variation w.r.t. another radial function f(-) is equivalent if there exists ci,C 2 > 0 
such that cir(a;) < f{x) < C 2 r{x), \/x, and in this case, the limits are related as follows: 
Vr{A) = Vf{c2A)/vf{c2Cr). 

Suppose that X admits a probability density fx w.r.t. the Lebesgue measure and let 
Jy be a probability on Cr such that /x(Al),/y (Al) > 0, VA > 1. Similarly to what we 
did in the previous section, we now study the relation between regular variation of fx in 
the sense © and of X. If fx S RV(u) for some non-null function v and the sequence is 
dominated by an integrable function, then 


/t-ix|XetCr(®) 


fxjtx) 


( f fxjtx) 

\Jcr fxjtl) 



-1 


vjx) 

Icr v{x)dx 


fvjx), 


( 12 ) 


thus the limiting probability density fy is homogeneo us and 
RV_q,(X) on Cr- As a comparison, Proposition 5.20 in IR esnick 


r y Sch effe’s Lemma X G 


19871 ] has the same con¬ 


clusion, however, its assumptions require the convergence to be uniform and the limiting 
density to be bounded on G : rjx) = 1}. 

Relations m and m are easily extended in the case of the general decay T. We 
write X G T-BV^^jH) if {T(Xi),... ,T(Xrf)} G RV5?iQ,(i^) or equivalently /xt* v, where 
:= fijX-kA) / fj.jX-k Cr) defines a probability distribution on Cr, VA G E. In this case, 
u is homogeneous of order —a, i.e., fijX-kA) = T(A)““/x(A), VA G E. 


3 Multivariate Exceedances 


Extreme-value theory aims at describing the distribution of an M'^-random vector X when 
at least one of its marginal is large. Typically, it is assumed that X is regularly varying on 

C^ll- 

density fy, know n to be 


= I® G : ||a;||oo}, i.e., t \ ||X||oo >t^Y, and that Y admits a probability 


-lx I IIXIU >t 
r omog eneous. This excludes the case of asymptotically inde¬ 


pendent marginals iResnicld 19871 ] . which is to be treated separately. The tail distribution 
of X is commonly i nferred by censoring the marginals whose absolute values fall under 


some large threshold iLedford and TawnI 19961 ] , requiring a model for fy and its censored 
versions. In this section, we first suggest a general procedure to obtain parametric homo¬ 
geneous densities whose multivariate marginals and censored densities are known. Second, 
we explain how to simplify fy when d is large using graphical models. 

We start by asking wether the marginals of a regularly varying vector are also regularly 
varying and, in this case, if the marginals of the limit are the limits of the marginals. The 
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following lemma provides an answer. Let A C {1,..., d}, C|"j^|| := {xa S : ||®yi||oo > 

1} and V ^ 0. 

Lemma 1.10 (Regular Variation of the Marginals). Suppose that 


Pr(6Vj > t \ ||X||oo > t) —>■ Cfei > 0, 6 € {—1,1}, i = 1,... ,d. (13) 

IfXe RV.a{Y) on q|.||^, then Xa G RV.^{Ya \ ||V^||oo > 1) on qf||^. If fx G 
RV-a{v) and the sequence is either monotone and v is integrahle or is dominated by an 
integrable function, then ||XA||oc>t ^ /i^aI I V.4l|oo>i- 

Proof. Under (fT3|) . Pr(||X^||oo > t \ ||-Xi||oo > t) ^ c > 0. For all i/-continuous Borel set 
B, 

PT{XA€tB) _Pi{XA€tB) Pr(||X|U>t) 

PrdlX^lloo >t) Pr(||X||oo >t)Pr{X Gt(C|f||^ xMl^^l)}’ 

and converges to Pr(V^ G B \ ||Vyi||oo > 1), proving the first part of the lemma. The 
second part is a direct consequence of the monotone or dominated convergence theorem. □ 

This result is specific to the Pareto decay (T = id): F{x,y) = 2/(el*l + e^^l) is homo¬ 
geneous on w.r.t. addition but its marginal Fx{x) = 2/(el^l + 1), although regularly 
varying, is not homogeneous. 


Example 1.11 (Limiting Density of a Multivariate Student Distribution). Let X be mul¬ 
tivariate Student f-distributed with mean 0 and dispersion matrix S C The sequence 

fx{tx)/fx{tl) is monotone and converges to v{x)/v{l) for 

v{x) = , 


which integrates to c < oo on q|.||j,^. As explained in (fT^ . ft-^x] ||x||oo>t(2/) ^ ^^{y) =: 

fviy), thus X G RV_q(V). In addition, the censored limiting density is the limit of the 
censored density, that is /i-ixA,|XA|<t| ||x||oo>i and Lemma [Lini ensures 

that the marginals of the limit are the limits of the marginals, i.e., ft-iXA\\\XA\\ao>t Iya- 
These quantities can thus be computed from the censored densities and marginals of the 
multivariate Student by passing to the limit. Up to a constant and a transformation of 


the univariate marginals, fY coin cides wi t 
extremal-f distribution derived bv iRibatetl 


I the density of the exponent measure of the 


201311 . 


As the previous example suggests, we can obtain parametric families for homogeneous 
distributions by computing limits of well-known multivariate distributions that are reg¬ 
ularly varying with decay T. We now focus on a complementary approach for modeling 
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Jy using graphical models. In short, graphical models offer a way to simplify a joint 
density by assuming some conditional independence between the marginals. Conditional 
indep endence — an d dependence — between marginals is only meaningful on a product 
20011 ]. Since Y has values in C'li.ii , the range of Yi depends on the other 


Dawid 


space 

marginals. To remedy this problem, we work with the censored vector 




(^il|yi|>i, • • • 


0 


with probability p, 
with probability 1 — p, 


(14) 


whose values lie in = {(oo,—1] x {0} x [l,oo)}‘^, where p = Pr(||X||oo > u) € (0,1) 
for some large threshold u. This stays in line with statistical inference, which typically 
estimates a censored version of the limiting distribution. 

The Hammersley-Clifford theorem states that if /y^c is positive, it factoriz es w.r.t. a 


gra^h g = ({1, ...,d},E) if and only if Yf^ _1L Yf \ V(i, j) ^ E 


Lauritzen 


1996(1 . The latter denotes conditional independence between Yf' and Y^ given the rest of 


the vector. Let us illustrate an application of the theorem. 


Example 1.12 (Factorization of the Limiting Density). Let Y have values in with 

fYi 2 \\\Yi 2 \\o.Mx,y) = U^ + y)~^ = fY23\\\Y23\\ooMx,y) andp = Pr(||Ei 2 ||oo > !)• From 
the Hammersley-Clifford theorem, Yf' _LL Y^ \ Y^ if and only if 


fYcix,y,z) 


fYgi^^y)fYgiy^^) 

fvfiy) 


y{x,y,z) G ({0} U [l,oo))^ 


where /y-c (x,0) = |p{x ^ — (a: -|- 1) /yc(x) = etc. 

From a statistical perspective, the distribution of Y'^ is to be estimated from samples of 
X. To that end, we consider a sequence Xf which is function of X and satisfies Xf ^ Y^ 
if X € RV_a(E). Crucially, the sequence of marginals {X^)£) for D C {1,... , d} coincides 
with the censored sequence (Xt^n)^ converging to Yf^. It is defined for all x G 
s.t. I®] > 1, H C D \ 0, by 


Pr(cr, 


vC 

’^^t,Dr\A 


> |®|,X 


c 

t,DnA<^ 


= 0) = p 


PT{crxXDnA > t\x\, \X 




<t) 


Pr(||X||oo >t) 


(15) 


and Pr(X^^ = 0) = 1 — pPr(||X£)||oo > t \ ||X||oo > t), where ax denotes the sign 
of X. Both X^ and Y^ admit probability densities f-^c and /y^c w.r.t. the measure 
po{Ai X ...X Ad) := ' {UieD-^i) on where A'^ is the 

d-dimensional Lebesgue measure and dg the Dirac measure on 0 G 


14 









We want to find conditions on X that impose a factorization of /^c. This brings 
us to extend the Hammersley-Clifford theorem for densities that factorize in the limit. 
Conditional independence is translated into asymptotic conditional independence, a notion 
little mentioned in the literature. 


4 Asymptotic Graphical Models 


Let Xt for t > 1 and X be R'^-valued random vectors with almost everywhere (a.e.) contin¬ 
uous probability densities fxt and fx w.r.t. a base measure typically the Lebesgue, the 
counting measure, or a combination of the two. For disjoint subsets A,B,C C {1 ,..., d}, 
we call the marginals Xt^A and Xt^B asymptotically conditionally independent with respect 
to Xc,t, written Xt,A-^Xt,B \ if 


{fj 


t, ABC 


fXt,C fXtAcfXt,BC )dfio ^ 0 . 


(16) 


Here, /xt^/Uo stands for weak convergence of i.e., the convergence of f gfxt(^lJ ‘0 for 
every g a.e. continuous and bound ed. When Xt = Xi, Vt, (fT6]l coincides with conditional 
independence of random variables 


Lauritzen 


199611 . 


Let Q = iy, E) be an undirected graph with set of nodes V = {1,... ,d] and set of edges 
E. We say that Xt satisfies the asymptotic Markov properties according to Q if {i, j} ^ E 
implies Xt^idLXt^j \ Xt^v\{i,j}- Moreover, we say that fxt asymptotically factorizes w.r.t. 
Q if Xt ^ X and fx factorizes w.r.t. Q. 


Theorem 1.13 (Asymptotic Hammersley-Clifford Theorem). Suppose Xt X, fx > 0 

and fxA bounded \/A CV a.e. The error et := fxt ~ fx satisfies 

{^t,vet,v\{i,j} — ^t,v\{i}^t,v\{j})dTo ^0) V(i,j) G E', (17) 


if and only if the following are equivalent: 

(i) fxt asymptotically factorizes w.r.t. Q, 

(ii) X satisfies the Markov properties according to Q, 

(Hi) Xt satisfies the asymptotic Markov properties according to Q. 

Proof. (0) dn]): apply Hammersley and Clifford theorem. ([n|) (|iiil) : it suffices to show 

relations of the form fxYzfz = fxzfvz a.e. if and only if {fxtYtZtfzt-fxtZtfYtzMho ^ 
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0. Rewrite the latter as 


ifxYzfz - fxzfYz)dfJ.o + {eXt,Yt,Zt(^Zt - f^Xt,Zt(^Yt,Zt)dfJ-0 (18) 

+iext,Yt,Ztfz + eztfx,Y,z - ext,ZtfY,z - eYt,Ztfx,z)dfJ.o ^ 0. 


Since (Xt,Yt,Zt) (X,Y,Z), et converges weakly to 0 and so does the last term in (fTSl) 
because fx,Y,z a-iid its marginals are a.e. continuous and bounded. Moreover, the middle 
term vanishes by assumption (|17p . proving the equivalence. □ 


As mentioned in iLauritzen 


199fit |. Example 3.11, the Markov property is in general not 


satisfied under weak convergence; Theorem 11.131 provides conditions under which it is. If 
fxt fx and the sequence is dominated by an integrable function, then fx^ ^ Jxa 
for all A C {1 ,..., d}. Hence, fx need not be bounded and, crucially, (fT7|) holds directly. 
Similarly, if fxt fx in L 2 , then fx^ ^ Ixa in L 2 by Jensen’s inequality, and we apply 
Holder’s inequality to show that (fTTll is satished. 


5 Asymptotic Graphical Modeling of Exceedances 


We explain how to model the exceedances of a random vector X with probability density 
fx using asymptotic graphical models. Let ^ be a decomposable graph with set of cliques 
C and set of intersections between them T>. Suppose that fx G RV_q(u) for u ^ 0 and 
that the sequence is either monotone and v is integrable or dominated by an integrable 
function. As seen in (fTOl) . f^-ix \ ||x||oo>t a-iid we assume that the probability density 

fx is positive. It follows that f^c —>■ fYC > 0, where and are defined in Section 
[3l From Theorem 11.131 Xf satisfies the asymptotic Markov properties according to G if 
and only if fyc factorizes w.r.t. G, i.e.. 


ncecAg(^c) 

/yc( 2 /) = - 1 

llDeV fv^iyo) 

Hence, for all A C {1,... , d} and s.t. |y^| > 1, 




(19) 


fYA,\Y'X\<liyA) 


ncgC:CnA 7^0 / jycnA) ricec dc,A 

YiDeV-.DnA^Ill f (yD ha) WD&cdD,A 


where fiVsnA) denotes the density of YY snA= < 


PS :=pPr(||F5||oo > 1) 
1 -PS 


1 I ll^slloo > 1 and 

if 5 n A / 0, 
else, 


( 20 ) 
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for all 5 € C U P. By positivity of fy, (fT3|) is satisfied and thus Lemma [1.101 gives Xs £ 
RV_q( 1^5 I || 1 ^ 5 l|oo > 1) on C'|'^.||^ with censored limiting density fiVsnA)- This is a key 
step as it means that the latter can be estimated from samples of X 5 . 

This suggests the following strategy to model exceedances of an i.i.d. sample of X. 
First, transform the positive and negative part of each marginal that exceeds a large 
threshold to unit Pareto to obtain the censored vector Xf. Second, select a decom- 
posable graph Q by testing conditional independence between the marginals of Xi = 
(Xil|Xi|>i, • • •, Xdl\Xa\>i)- Third, for all 5 G C U P, select a model for the homogeneous 
density fy^i ||yg||>i with unit Pareto marginals; estimate it using a sample from X^ g, and 
estimate ps ~ Pr(||Xi_ 5 ||oo > 1). This determines the asymptotic approximation of f^c 
in ( 1201 ) . 


In low dimensions, there is a rich class of models for extreme-value distributions iGudendorf and Seeers 


2010l |. If the later have Frechet marginals, we have seen that their exponent measure den¬ 


sity, when it exists, is homogeneous and thus provides a model for fy^ | ||ys||oo>i- ^ para¬ 

metric approach, a refinement is to impose the parameters of fyfj{-]0c) to be consistent 
with the ones of fyj^{-] Od) whenever D G C and to estimate them from sufficient statistics; 
Example 11.111 showed a way to find consistent par ametric families. An ext r eme-value dis¬ 
tribut ion often used in practice is the Hiisler-Reiss iHiisler and Reiss! 19891] , lEngelke et al. 


201 ,^ . 


Example 1.14 (Hiisler-Reiss and Conditional Independence). The Hiisler-Reiss exponent 
measure has homogeneous probability density fy\Y^>iiy) = y^‘^4>{y-k I Vk) on Ck = 
{^ > 0 : yfc > 1}, Vfc = 1,... ,4, where cj) is the density of the multivariate log-normal 
distribution on with mean (logyfc ~ and covariance matrix where 

= ^{Fjfc -|- Fjfc — Fjj} 2 <ij<rf, and F^- = Sjj -|- T,jj — 2Sjj is the incremental variance 
defined by a correlation matrix S C The marginals Ya \ ||E^||oo > 1 and the 

censored densities can be computed. Moreover, conditional independence corresponds to 
a specific constraint on the covariance matrix, analogously to the Gaussian case: 


Yi ALYIY 




on Ck 






F jk = 0. 


( 21 ) 


A factorization of the homogeneous limiting density induces a specific factorization of 
its angular density. As explained in the next example, this gives a way to build a high¬ 
dimensional angular density satisfying the marginal constraints. 


17 














Example 1.15 (Factorization of the Angular Density). In (I20p . Jyc is expressed in terms 
of homogeneous lower dimensional densities that can be written as 


fYslWYsWooMvs) = kr{ys) hs{e{ys)}J^^{ys), SgCUV, 


where 4>s ■ ys is the radial system of coordinates defined in ([ 8 ]) with Jacobian 

determinant > 0 , hs is the angular probability density on D 5 , and k the normalizing 
constant. When no marginal is censored, fyc is homogeneous and its angular density reads 

r(0c)-“-' hc{0c)J^ci^c) 


h{e) = 


cec 


k n 

YIdgv 'r{0D)~°‘~^hD{0D) 


( 22 ) 


The censored case goes similarly. We now derive the constraint that a parametric family 
for angular densities must satisfy to be consistent. Consistency means that for all A = 
Au{i} C{1,...,4, 

/^aIII’^aI|oo>i(2/a) = k fY^{yA)dyi, 

JR 

and translates into 


hAiO) = k 


[ r{e/{e),z} 

Jr 


-“-1 hx [e{e-j}{e),z}] 


(23) 


by a change of variable 2 ; = yi/r{yA) and setting y = r{y a)6 In particular, when 
r{x) = II • 111 , 9{x) = xjrix)^ and ® > 0, (f25|l becomes 


1*00 

hA{9) = k / {l — w)°'^‘^~‘^hj^{{l — w)0,w}dw. 
JO 


using the change of variable w = zl{z + 1). iBallani and Schlathen 201 ll | showed that the 
asymmetric logistic model satisfies this constraint and is thus a consistent family — its 
censored density is however unattractive. 

Asymptotic graphical modeling for extremes necessitates testing asymptotic conditional 
independence, which is difficult in a non-parametric setting. As a first approximation, we 
can easily test conditional indep endence of the b i nary vector B = (l|xp>t,..., l|jVd|>t) 
or a more refined discretization iNaearaian et al.l 2013l |. Further efforts are needed to 


build a test for tail conditional independence in the parametric continuous case — for 
insta nce, by relying upon (1211) . Lastly, we mention the success of the Gaussian graphical 
lasso 


Friedman et al 


20081 ]. which has been able to tackle high-dimensional problems by 


imposing sparsity of the inverse covariance matrix. Future work could allow a similar 
approach for the Hiissler-Reiss distribution. 
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